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(54) IVlethod and apparatus for providing reduced cost online service and adaptive targeting of 
advertisements 



(57) A nnethod and apparatus for adaptively target- 
ing advertisements to a specific client computer from a 
server within a distributed data processing system is 
provided. As a user of the client browses the World Wide 
Web, the material that is downloaded to the client con- 
stitutes a datastream. At some location during the rout- 
ing of the datastream, either on the server or at the cli- 
ent, the datastream is scanned to generate a list of key- 
words that ar e present within the datastrea m. The da- 
tastream may be analyzed in real-time or cached and 
analyzed on a delayed basis. The generated list of key- 
words represents a summary of the content that ap- 
pears to be the focus of interest of the user. The key- 



words are c ompared against a database of a dvertise- 
ments, and the server selects an advertisement tlTat 
matches the user's area of interest in comparison to the 
analysis of the user's browsing history The selected ad- 
vertisement is then inserted into the datastream to be 
routed to the client. In consideration for viewing targeted 
advertisements and to entice a Web viewer to allow the 
monitoring of a datastream so that targeted advertise- 
ments may be placed into the datastream, a Web viewer 
may receive online connection service for free, tor a re- 
duced cost, at a premium level of service, or for other 
some other value, such as frequent viewer credits that 
may be exchanged for goods and services. 
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Description 

BACKGROUND OF THE iNVENTiON 

1. Technical Field: 

[0001] The present invention relates generally to an 
improved data processing system and, in particular, to 
a method and apparatus for monitoring and processing 
a datastream during a commercial network connection. 

2. Description of Related Art: 

[0002] The World Wide Web (WWW. also known sim- 
ply as "the Web") is an abstract cyberspace of informa- 
tion that is physically transmitted across the hardware 
of the Internet. In the Web environment, servers and cli- 
ents communicate using Hypertext Transport Protocol 
(HTTP) to transfer various types of data files. Much of 
this information is in the form of Web pages identified 
by unique Uniform Resource Locators (URLs) or Uni- 
form Resource Identifiers (URIs) that are hosted by 
servers on Web sites. The Web pages are often format- 
ted using Hypertext Markup Language (HTML), which 
is a file format that is understood by software applica- 
tions, called Web browsers. A browser requests the 
transmission of a Web page from a particular URL, re- 
ceives the Web page in return, parses the HTML of the 
Web page to understand its content and presentation 
options, and displays the content on a computer display 
device. By using a Web browser, a user may navigate 
through the Web using URLs to view Web pages. 
[0003] As the Web continues to increase dramatically 
in size, corporations and organizations have attempted 
to promote themselves as Web portals or, following the 
analogy of the Internet and the Web has being an infor- 
mation superhighway, as an on-ramp to the Web. By in- 
dexing and collecting vast amounts of information re- 
sources within one Web site, a Web portal attempts to 
become a user's favorite Web starting point for each 
navigation session of the Web. A user, knowing that he 
may be able to find almost ail desired information within 
a single Web site, may prefer to search and browse 
through the Web portal. A Web portal gains a commer- 
cial advantage through increased viewership by receiv- 
ing more advertisement revenue per Web page viewed. 
However, in its attempt to attract as many users as pos- 
sible, Web portals increasingly gather together a vast 
quantity of disparate content. The publication of Web 
pages is an unrestricted and uncoordinated process. 
The latest news, book reviews, amateur poetry, games, 
research, stock and bond prices quotes, chat rooms, 
groups, clubs, shareware, technical support, on-line 
shopping, etc., may be found on the Web. There is no 
ability to know beforehand the type of content that drew 
the user to the Web portal. 

[0004] While browsing the Web, a user may follow a 
hyperlink from one Web page to another Web page. The 



user may not know what type of content will be viewed 
upon the target or destination of the hyperlink because 
hyperlinks are usually small images or very short text 
strings that provide only a term or a few words that may 

5 interest the user based on th e context of the surrounding 
material. The content of the presented hyperlink is often 
a text string that merely provides the URL of the desti- 
nation Web page, and the URL is arbitrarily defined by 
the Web master of a Web site or by the author of the 

10 Web page. Hence, a user may follow a hyperlink without 
knowing what material is being retrieved, and the da- 
tastream of downloaded material may be difficult to clas- 
sify within a category of subject matter. 
[0005] A significant number of commercial Web sites 

15 lose money, and it the Web site generates a profit, the ' 
source of the profits are usually advertising revenues. 
Providing advertisement space on Web pages was one 
of the first moneymaking opportunities for owners of 
commercial Web sites or authors of Web pages. Banner 

20 advertisements at the top of Web pages are now ubiq- 
uitous. Commercial Web sites rely on such advertise- 
ments to generate positive income, and advertisers rely 
on this Web space for the opportunity to create brand- 
name recognition for their products and services. As 

25 noted above, Web portals attempt to increase their ad- 
vertisement revenue by increasing the number of 
viewed Web pages, so called "eyeball count". 
[0006] Advertisers also desire to pay for advertise- 
ment placement based on the demographics of Web 

30 viewers . However, many Web portals and Internet 
Service Providers (ISPs) protect user privacy and do not 
collect biographical or demographic information from 
the users that surf the various Web sites on their servers 
or across their domains. Hence, it is difficult for adver- 
ts tisers to target their advertisement expenditures based 
on Web viewer demographics. Advertisers may also de- 
sire to target advertisement expenditures indirectly 
based on the content being viewed and downloaded by 
the viewers of a Web page or Web portal. However, be- 

40 cause Web portals gather disparate content, it is difficult 
to ascertain the reasons why a user may be viewing par- 
ticular content. 

[0007] This situation is unlike placing advertisements 
on cable or network television. In that case, information- 
's al program guides about the content being carried on 
the various channels is widely disseminated well ahead 
of the time of broadcast of the content. Television view- 
ers are enticed to watch programs containing definable 
content. Based on consumer research, advertisers can 
50 ascertain which demographic segment of a population 
may be attracted to the content being displayed on the 
channel. Based on the resulting research data, adver- 
tisers will target particular advertisements to particular 
segments of the population based on the content of the 
55 broadcast. 

[0008] In the case of many Web viewers accessing 
and downloading content from a Web site or Web server, 
advertisers may have a comparatively difficult time tar- 
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geting each viewer with effective advertisements so that 
the advertiser's expenditures are used wisely. In addi- 
tion to targeting advertisements once a person is view- 
ing Web content, advertisers require a method for en- 
ticing Web viewers to view advertisements in a manner 
similar to enticing television viewers to view particular 
television program content and, hence, particular adver- 
tisements placed in that content. 
[0009] Therefore, it would useful to have a method for 
directing advertisements to particular segments of users 
that download material from particular Web portals in 
order to increase the efficiency of advertisement expen- 
ditures. It would be particularly advantageous to target 
each user uniquely without gathering biographical infor- 
mation about each particular user. It would also be ad- 
vantageous to entice Web viewers to view particular ad- 
vertisements. 

SUMMARY OF THE INVENTION 

[0010] According to a first aspect of the invention 
there is provided a method for providing online connec- 
tion service from a server to a client in a distributed data 
processing system, the method comprising the compu- 
ter-implemented steps of: inserting selected advertise- 
ments into a datastream for a session connecting the 
server with the client; and reducing a cost for the online 
connection service in response to the insertion of the 
selected advertisements into the datastream. 
Q [001 1] According to a second aspect of the invention 
there is provided a method for sending advertisements 
from a server to a client in a distributed data processing 
system, the method comprising the computer-imple- 
mented steps of: scanning, on the server, a datastream 
for a session connecting the server with the client; gen- 

ierating a list of keywords from content within the datast- 
ream; selecting advertisements based on the generated 
keyword list; and inserting the selected advertisements 
into the datastream. 

[0012] According to a third aspect of the invention 
there is provided a distnbuted data processing system 
for providing online connection service from a server to 
a client, the distributed data processing system compris- 
ing: inserting means for inserting selected advertise- 
ments into a datastream for a session connecting the 
server with the client; and reducing means for reducing 
a cost for the online connection service in response to 
the insertion of the selected advertisements into the da- 
tastream. 

0 [0013] According to a fourth aspect of the invention 
there is provided a distributed data processing system 
for sending advertisements from a server to a client, the 
distributed data processing system comprising: scan- 
ning means for scanning, on the server, a datastream 
for a session connecting the server with the client; gen- 

Ierating means for generating a list of keywords from 
content within the datastream; selecting means for se- 
lecting advertisements based on the generated keyword 



list; and Inserting means for inserting the selected ad- 
vertisements into the datastream. 
[0014] According to a fifth aspect of the invention 
there is provided a distributed data processing system 

5 for receiving advertisements at a client from a server, 
the distributed data processing system comprising: 
scanning means for scanning, on the client, a datast- 
ream for a session connecting the client with the server; 
generating means for generating a list of keywords from 

10 content within the datastream; sending means for send- 
ing the generated list of keywords to the server; and re- 
ceiving means for receiving advertisements within the 
datastream, wherein the advertisements comprise con- 
tent semanticallv related to the generated list of key- 

^5 words. 

[0015] According to a sixth aspect of the invention 
there is provided a computer program product in a com- 
puter readable medium for use in a distributed data 
processing system for providing online connection serv- 

20 ice from a server to a client, the computer program prod- 
uct comprising: first instructions for inserting selected 
advertisements into a datastream for a session connect- 
ing the server with the client; and second Instructions 
for reducing a cost for the online connection service in 

25 response to the insertion of the selected advertisements 
into the datastream. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 [0016] The novel features believed characteristic of 
the invention are set forth in the appended claims. The 
invention itself, however, as well as a preferred mode of 
use, further objectives and advantages thereof, will best 
be understood by reference to the following detailed de- 

35 scription of an illustrative embodiment when read in con- 
junction with the accompanying drawings, wherein: 

Figure 1 is a pictorial representation of a distributed 
data processing system in which the present inven- 
40 tlon may be Implemented; 

Figure 2 is a block diagram depicting a data 
processing system, which may be implemented as 
a server in accordance with a preferred embodi- 
es nnent of the present invention; 

Figure 3 is a block diagram illustrating a data 
processing system which may be implemented as 
a client in accordance with a preferred embodiment 
50 of the present invention; 

Figure 4 is a flowchart depicting a distributed data 
processing system for adaptively targeting adver- 
tisements for a client using a datastream scanner 
55 within the server; 

Figure 5 is a block diagram depicting a distributed 
data processing system that adaptively targets ad- 
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vertisements for a client by scanning tlie datast- 
ream addressed to the client with a data processing 
unit within the client itself; 

Figure 6 is a flowchart depicting a process in which 
a client receives a datastream containing adaptively 
targeted advertisements; 

Figure 7 is a flowchart depicting the processing per- 
formed by a server that scans a datastream in order 
to insert adaptively targeted advertisements ad- 
dressed to a client; 

Figure 8 is a flowchart describing a process in 
which a client scans an incoming datastream ad- 
dressed to the client in order to generate browse 
history information that is transmitted back to a 
server; 

Figure 9 is a flowchart depicting a process in which 
a server receives browse history Information from a 
client and inserts adaptively targeted advertise- 
ments into the datastream sent back to the client; 

Figure 10 is a flowchart depicting the process by 
which a server selects adaptively targeted adver- 
tisements based on the browsing history of a client; 
and 

Figure 11 is a flowchart depicting. a process by 
which a server receives electronically published ad- 
vertisements and processes these advertisements 
so that they may be retrieved at some later point as 
adaptively targeted advertisements. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

[001 7] With reference now to the figures, Figure 1 de- 
picts a pictorial representation of a distributed data 
processing system in which the present invention may 
be implemented. Distributed data processing system 
100 is a network of computers in which the present In- 
vention may be implemented. Distributed data process- 
ing system 100 contains a network 102, which is the me- 
dium used to provide communications links between 
various devices and computers connected together 
within distributed data processing system 100, Network 
102 may include permanent connections, such as wire 
or fiber optic cables, or temporary connections made 
through telephone connections. 
[0018] In the depicted example, a server 104 is con- 
nected to network 102 along with storage unit 106. In 
addition, clients 108, 110, and 112 also are connected 
to a network 102. These clients 108, 110, and 112 may 
be, for example, personal computers or network com- 
puters. For purposes of this application, a network com- 
puter is any computer, coupled to a network, which re- 



ceives a program or other application from another com- 
puter coupled to the network. In the depicted example, 
server 104 provides data, such as boot files, operating 
system images, and applications to clients 108-112. Cli- 
s ents 1 08, 1 1 0, and 1 1 2 are clients to server 1 04. Distrib- 
uted data processing system 1 00 may include additional 
servers, clients, and other devices not shown. In the de- 
picted example, distributed data processing system 100 
is the Internet with network 102 representing a world- 
^0 wide collection of networks and gateways that use the 
TCP/IP suite of protocols to communicate with one an- 
other. At the heart of the Internet is a backbone of high- 
speed data communication lines between major nodes 
or host computers, consisting of thousands of commer- 
cial, government, educational and other computer sys- 
tems that route data and messages. Of course, distrib- 
uted data processing system 100 also may be imple- 
mented as a number of different types of networks, such 
as for example, an intranet, a local area network (LAN), 
or a wide area network (WAN). Figure 1 is intended as 
an example, and not as an architectural limitation for the 
present invention. 

[0019] Referring to Figure 2, a block diagram depicts 
a data processing system, which may be implemented 
as a server, such as server 104 in Figure 1, in accord- 
ance with a preferred embodiment of the present inven- 
tion. Data processing system 200 may be a symmetric 
multiprocessor (SMP) system including a plurality of 
processors 202 and 204 connected to system bus 206. 
Alternatively, a single processor system may be em- 
ployed. Also connected to system bus 206 is memory 
controller/cache 208, which provides an Interface to lo- 
cal memory 209. I/O bus bridge 21 0 is connected to sys- 
tem bus 206 and provides an interface to I/O bus 212. 
Memory controller/cache 208 and I/O bus bridge 210 
may be integrated as depicted. 
[0020] Peripheral component interconnect (PCI) bus 
bridge 214 connected to I/O bus 212 provides an inter- 
face to PCI local bus 216. A number of modems maybe 
connected to PCI bus 216. Typical PCI bus implemen- 
tations will support four PCI expansion slots or add-in 
connectors. Communications links to network comput- 
ers 108-112 in Figure 1 may be provided through mo- 
dem 21 8 and network adapter 220 connected to PCI lo- 
cal bus 21 6 through add-in boards. 
[0021] Additional PCI bus bridges 222 and 224 pro- 
vide interfaces for additional PCI buses 226 and 228, 
from which additional modems or network adapters may 
be supported. In this manner, server 200 allows connec- 
tions to multiple network computers. A memory -mapped 
graphics adapter 230 and hard disk 232 may also be 
connected to I/O bus 212 as depicted, either directly or 
indirectly. 

[0022] Those of ordinary skill in the art will appreciate 
that the hardware depicted In Figure 2 may vary. For 
example, other peripheral devices, such as optical disk 
drives and the like, also may be used in addition to or in 
place of the hardware depicted. The depicted example 
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is not meant to imply arcliitectural limitations with re- 
spect to the present invention. 

[0023] The data processing system depicted in Fig- 
ure 2 may be, for example, an IBM RISC/System 6000 
system, a product of International Business Machines 
Corporation in Armonk, New York, running the Ad- 
vanced Interactive Executive (AIX) operating system. 
[0024] With reference now to Figure 3, a block dia- 
gram illustrates a data processing system in which the 
present invention may be implemented. Data process- 
ing system 300 is an example of a client computer. Data 
processing system 300 employs a peripheral compo- 
nent interconnect (PCI) local bus architecture. Although 
the depicted example employs a PCI bus, other bus ar- 
chitectures such as Micro Channel and ISA may be 
used. Processor 302 and main memory 304 are con- 
nected to PCI local bus 306 through PCI bridge 308. PCI 
bridge 308 also may include an integrated memory con- 
troller and cache memory for processor 302. Additional 
connections to PCI local bus 306 may* be made through 
direct component interconnection or through add-in 
boards. In the depicted example, local area network 
(LAN) adapter 31 0, SCSI host bus adapter 312, and ex- 
pansion bus interface 314 are connected to PCI local 
bus 305 by direct component connection. In contrast, 
audio adapter 316, graphics adapter 31 8, and audio/vid- 
eo adapter 319 are connected to PCI local bus 306 by 
add-in boards inserted into expansion slots. Expansion 
bus interface 314 provides a connection for a keyboard 
and mouse adapter 320, modem 322, and additional 
memory 324. SCSI host bus adapter 312 provides a 
connection for hard disk drive 326, tape drive 328, and 
CD-ROM drive 330. Typical PCI local bus implementa- 
tions will support three or four PCI expansion slots or 
add-in connectors. 

[0025] An operating system runs on processor 302 
and is used to coordinate and provide control of various 
components within data processing system 300 in Fig- 
ure 3. The operating system may be a commercially 
available operating system such as OS/2, which is avail- 
able from International Business Machines Corporation. 
"OS/2" is a trademark of International Business Ma- 
chines Corporation. An object oriented programming 
system such as Java may run in conjunction with the 
operating system and provides calls to the operating 
system from Java programs or applications executing 
on data processing system 300. "Java" is a trademark 
of Sun Microsystems, Inc. Instructions for the operating 
system, the object-oriented operating system, and ap- 
plications or programs are located on storage devices, 
such as hard disk drive 326, and may be loaded into 
main memory 304 for execution by processor 302. 
[0026] Those of ordinary skill in the art will appreciate 
that the hardware in Figure 3 may vary depending on 
the implementation. Other internal hardware or periph- 
eral devices, such as flash ROM (or equivalent nonvol- 
atile memory) or optical disk drives and the like, may be 
used in addition to or in place of the hardware depicted 



in Figure 3. Also, the processes of the present invention 
may be applied to a multiprocessor data processing sys- 
tem. 

[0027] For example, data processing system 300, if 

5 optionally configured as a network computer, may not 
include SCSI host bus adapter 31 2, hard disk drive 326, 
tape drive 328, and CD-ROM 330, as noted by dotted 
line 332 in Figure 3 denoting optional inclusion. In that 
case, the computer, to be properly called a client com- 

10 puter, must include some type of network communica- 
tion interface, such as LAN adapter 310, modem 322, 
or the like. As another example, data processing system 
300 may be a stand-alone system configured to be 
bootable without relying on some type of network com- 

15 munication interlace, whether or not data processing 
system 300 comprises some type of network communi- 
cation interface. As a further example, data processing 
system 300 may be a Personal Digital Assistant (PDA) 
device which is configured with ROM and/or flash ROM 

20 in order to provide nonvolatile memory for storing oper- 
ating system files and/or user-generated data. 
[0028] The depicted example in Figure 3 and above- 
described examples are not meant to imply architectural 
limitations. 

26 [0029] With reference now to Figure 4, aflowchart de- 
picts a distributed data processing system for adaptively 
targeting advertisements for a client using a datastream 
scanner within the server. Distributed data processing 
system 400 contains client 402 running browser appii- 

30 cation 404 that generates HTTP requests 406 that are 
sent to server 408. Client requests that are addressed 
to other servers on the Internet are routed to the Internet 
through interface 410.. If server 408 is able to fulfill the 
HTTP request message without routing the message to 

35 another server on the Internet, server 408 may access 
its own internal database and generate an HTTP re- 
sponse that is sent to client 402. Othenwise, server 408 
receives data from the Internet at physical interface 41 2 
and eventually routes this data to client 402 as HTTP 

40 response 414. 

[0030] Distributed data processing system 400 re- 
ceives electronically published advertisements from 
various customers and stores these advertisements in 
unprocessed advertisements database 41 6. Advertise- 
rs ment processing unit 418 retrieves the advertisements 
from unprocessed advertisements database 416, proc- 
esses these advertisements, and stores these adver- 
tisements in categorized advertisements database 420. 
The advertisements are processed by scanning the ad- 

50 vertisements for keywords that convey the gist of the 
content of each of the advertisements. These keyword 
lists are associatively stored with the advertisements in 
categorized advertisements database 420. 
[0031] Information about the advertisers may be 

55 stored in advertiser database 421 . Rather than deter- 
mining the gist of the content of each advertisement 
from the subject matter within the advertisement, an ad- 
vertiser may provide a set of keywords that capture a 
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wider context in which the advertiser believes the ad- 
vertiser's product or service belongs. For example, a car 
manufacturer may create an advertisement for an auto- 
mobile that does not use the words "car", "vehicle", or 
"automobile. " Thus, if a keyword list was generated from s 
the content of the advertisement, the generated key- 
word list may not capture or convey the context in which 
the advertiser would desire to have the automobile ad- 
vertisement displayed. Hence, the advertiser may cre- 
ate a keyword list tor a complete category of products to 
or services that target a particular demographic or par- 
ticular types of datastreams without relying on a key- 
word list generated from the advertisement's content. In 
this manner, the advertiser may target an automobile 
advertisement to someone that is browsing the Web for ^5 
certain types of luxury goods. 
[0032] Server 408 receives a datastream from the In- 
ternet that is directed or addressed to client 402 and is 
responsible for routing the datastream to the client. 
However, in a system that incorporates the present in- 20 
vention, server 408 processes the datastream to some 
extent before routing the datastream to client 402. Da- 
tastream scanner unit 422 scans the content of the da- 
tastream and generates browse history information 424 
that is sent to advertisement selector unit 426. The con- 25 
tent of the datastream comprises URIs and other infor- 
mation with HTTP messages, File Transport Protocol 
(FTP) packets, TCP/IP packets, etc. Alternatively, da- 
tastream scanner unit 422 may work in conjunction with 
optional upstream scanner unit 423 that scans the HTTP 50 
request messages from the client. Information concern- 
ing the client requests may be combined with the infor- 
mation gleaned from the downloaded datastream, and 
the combined information may be fonvarded to adver- 
tisement selector unit 426. ^5 
[0033] Advertisement selector unit 426 also receives 
session characterization information 428 containing in- 
formation about a particular session or connection be- 
tween server 408 and client 402. In other words, as a 
connection is made between the client and the server, 4o 
Information is stored and controlled by the server that 
describes various parameters about the connection. 
The session characterization information may include 
different types of information, such as: the type of com- 
puter platform that is being utilized as the client compu- ^5 
ter; the type and version number of client software or 
browser software that is being used to receive the down- 
loaded datastream; the current time of day for the ses- 
sion between the client arid the server, the current sea- 
son of the year in which the session is being held; the so 
amount of available connection bandwidth for the ses- 
sion; the effective download speed of the client; the 
amount of connection time for the session; the number 
of bytes, the number of Web pages, the number of files, 
or other content metrics concerning the material that has 
been downloaded to the client; a list of advertisements 
inserted into the monitored datastream; the number of 
advertisements either sent or targeted to the client; and 



the geographic location of the client. Advertisement in- 
serter unit 430 scans and monitors the datastream ad- 
dressed to client 402 to determine appropriate points in 
the datastream for Inserting adaptively targeted adver- 
tisements to the user of browser application 404. Adver- 
tisement inserter unit 430 generates a request that is 
sent to advertisement selector unit 426. 
[0034] Advertisement selector unit 426 uses browse 
history information 424 and session characterization in- 
formation 428 to select an adaptively targeted advertise- 
ment from categorized advertisements database 420. If 
advertisement selector unit 426 decides that catego- 
rized advertisements database 420 does not have a 
suitable advertisement for a relative match with browse 
history information 424 and session characterization in- 
formation 428, advertisement selector unit 426 may se- 
lect an advertisement from advertiser database 421 . 
[0035] Advertisement selector unit 426 sends the se- 
lected advertisement to advertisement inserter unit 430. 
which places the advertisement into the appropriate 
point in the datastream and forwards the modified da- 
tastream or HTTP response message to datastream 
scanner unit 422. 

[0036] In this manner, datastream scanner unit 422, 
advertisement selector unit 426, and advertisement in- 
serter unit 430 work in conjunction to mon Itor th e datast- 
ream directed or requested by the user of browser ap- 
plication 404 so that server 408 may determine appro- 
priate points within the datastream for insertion of target 
advertisements and then placing carefully selected ad- 
vertisements in those locations within the datastream. 
The datastream may be stored as a cached datastream 
so that datastream scanner unit 422 does not scan in 
real-time while the datastream is flowing through the 
server. At appropriate or predetermined intervals, da- 
tastream scanner unit 422 reports the browse history in- 
formation using statistics gathered from the cached da- 
tastream. The choice of using a cached datastream may 
depend upon the processing speed of datastream scan- 
ner unit 422. the bandwidth of the datastream, etc. 
[0037] The relative placement of advertisement in- 
serter unit 430 and datastream scanner unit 422 at suc- 
cessive positions along the datastream, may be re- 
versed without affecting the ability of server 408 to se- 
lectively target advertisements to a client system. In the 
example shown, advertisement inserter unit 430 places 
selected advertisements within the datastream that is 
then scanned by datastream scanner unit 422. By plac- 
ing the advertisements into the datastream before the 
datastream is scanned, the advertisement that has been 
placed into the datastream becomes part of browse his- 
tory information 424. If the inserted advertisement is 
properly selected by advertisement selector unit 426, 
the inserted advertisement will blend with the content of 
the datastream so that the advertisement does not skew 
the results generated by datastream scanner unit 422. 
If it is determined by the empirical results that the inser- 
tion of advertisements into the datastream skew the 
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generated results from the scan of the datastreann, a 
feedback mechanisnn nnay be employed in which adver- 
tisement selector unit 426 also provides the selected ad- 
vertisement to datastream scanner unit 422 so that the 
insertion of an advertisement Into the datastream does 
not skew the scanned results generated by datastream 
scanner unit 422. 

[0038] In an alternative embodiment, datastream 
scanner unit 422 scans the datastream before the inser- 
tion of an advertisement by advertisement inserter unit 
430. In this case, the results generated by datastream 
scanner unit 422 will represent the content actually re- 
quested by the user of browser application 404, i.e., the 
generated results will represent the datastream without 
advertisements. However, in situations in which adver- 
tisements constitute a large percentage of the volume 
of the data in the datastream addressed to client 402, 
the generated results may represent only a small portion 
of the datastream. In other words. If the user of browser 
application 404 is browsing through Web pages that 
contain a lot of advertisements, and many advertise- 
ments are being Inserted into the datastream, then the 
generated results of datastream scanner unit 422 may 
not accurately represent the content of the advertise- 
ments in the browsed history information 424. 
[0039] Underlying the ability to direct targeted adver- 
tisements to a viewer Is the assumption that a user may 
have agreed to receive such advertisements in return 
for some type of valuable consideration. At some prior 
point in time, the viewer has set up an account with an 
online service provider, or Internet service provider 
(ISP), or the equivalent. At that time, the viewer may 
have been informed that the ISP would provide an In- 
centive to the viewer to receive targeted advertise- 
ments. The viewer may accept or decline the offer at 
that time, but the viewer may change this preference at 
some time In the future, either by selecting a function 
within the browserthat notifies the server that the viewer 
has changed this preference or by submitting an elec- 
tronic form or Web page to notify the server of the 
change In preference. The viewer may toggle this pref- 
erence on/off multiple times during a single session 
through a configuration request. 
[0040] Upstream scanner unit 423 may monitor the in- 
coming messages from the client for configuration re- 
quests from the client. The server uses these configu- 
ration requests as part of the session characterization 
information In order to determine whether to place tar- 
geted advertisements into the viewer's datastream. 
[0041] In return for receiving these advertisements, 
the viewer may receive free connection time from the 
ISP, which the ISP can provide because the ISP collects 
revenue from advertisers who expect that the ISP will 
provide a service for adaptively targeting these adver- 
tisements to viewers. The ISP earns a profit because 
the revenue from the advertisers is greater than the con- 
nection fees that could be collected from the viewers 
that connect to the ISP. 



[0042] The fee schedule between the ISP and the 
viewer may be an all-or-nothing arrangement or a grad- 
uated scale. The viewer may receive free connection 
time In return for receiving only targeted aclvertise- 

5 ments. Alternatively the viewer may receive some value 
for receiving a certain number of targeted advertise- 
ments, and the viewer receives more value for a larger 
number of viewed advertisements. By monitoring the 
number of advertisements that the server has transmlt- 

10 ted to the client or by monitoring the amount of time that 
the viewer has been connected while viewing targeted 
advertisements, the ISP can appropriately compute the 
monetary value to be credited to the viewer's account. 
The number of targeted advertisements that have been 

75 sent to the client can be used as part of the session char- 
acterization information. For example, a viewer may re- 
ceive connection time at a reduced cost that Is Inversely 
proportional to the number of viewed advertisements. 
As more advertisements are viewed, the ISP gradually 

20 reduces the cost for the viewer's next month of online 
connection time through the ISP 
[0043] In lieu of receiving reduced cost connection 
time, a viewer may receive frequent viewer credits that 
may be exchanged for goods or services. The viewer 

25 may exchange the credits for goods or services from a 
particular advertiser in return for viewing advertise- 
ments from the advertiser, or the viewer may exchange 
the credits for goods and services from the ISP. 
[0044] With reference now to Figure 5, a block dia- 

30 gram depicts a distributed data processing system that 
adaptively targets advertisements for a client by scan- 
ning the datastream addressed to the client with a data 
processing unit within the client Itself. Distributed data 
processing system 500 contains client 502 running 

35 browser application 504 that generates HTTP requests 
506 that are sent to server 508. Client requests that are 
addressed to other servers on the Internet are routed to 
the Internet through interface 510. If server 508 is able 
to fulfill the HTTP request message without routing the 

40 message to another server on the Internet, server 508 
may access its own internal database and generate an 
HTTP response that Is sent to client 502. Otherwise, 
server 508 receives data from the Internet at physical 
Interface 512 and eventually routes this data to client 

45 502 as HTTP response 514. 

[0045] Distributed data processing system 500 re- 
ceives electronically published advertisements from 
.various customers and stores these advertisements in 
unprocessed advertisements database 516. Advertlse- 

50 ment processing unit 518 retrieves the advertisements 
from unprocessed advertisements database 516, proc- 
esses these advertisements, and stores these adver- 
tisements In categorized advertisements database 520. 
Advertiser database 521 stores Information about the 

55 advertisers In a manner similar to advertiser database 
421, described with respect to Figure 4. 
[0046] As previously described above, the location of 
advertisement Inserter unit 430 and datastream scanner 
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unit 422 respectively along the data flow path for the 
datastream addressed to client 402 may be reversed or 
separated without affecting the ability of server 408 to 
adaptively target advertisements to the user of browser 
application 404 in Figure 4. Figure 5 describes an ex- 
ample in which the present invention is partially embed- 
ded within browser application 504 in client 502. 
[0047] The function of scanning the datastream di- 
rected to the user of browser application 504 is per- 
formed by datastream scanner unit 534. Datastream 
scanner unit 534 scans the datastream to generate 
browse history information concerning the content re- 
ceived at client 502 from the browsing activities of the 
user The browse history information generated by da- 
tastream scanner unit 534 is pushed to server 508 by 
browser 504. The periodicity of the push events may de- 
pend on the amount of data downloaded by client 502, 
the bandwidth limitations of client 502, a predetermined 
schedule set by server 508 for receiving the browse his- 
tory information, etc. 

[0048] Once server 508 receives HTTP and browse 
history information 506, server 508 routes HTTP re- 
quest onto the Internet while keeping browse history in- 
formation 532 that is filtered by optional upstream scan- 
ner unit 523 which scans the incoming requests from 
the client. Browse history information 532 may be em- 
bedded in HTTP messages or embedded in some other 
type of data packet communicated to server 508 in a 
variety of different protocols. Upstream scanner unit 523 
directs browse history information 532 to advertisement 
selector unit 526 that processes and stores browse his- 
tory information 532 for each session or connection to 
server 508. Advertisement selector unit 526 also re- 
ceives session characterization information 528 that de- 
scribes each particular session currently being served 
by server 508. Advertisement inserter unit 530 scans 
and monitors the datastream addressed to client 502. 
At appropriate locations within the datastream, adver- 
tisement inserter unit 530 determines that an adaptively 
targeted advertisement should be placed into the da- 
tastream and requests an advertisement from adver- 
tisement selector unit 526. 

[0049] Advertisement selector unit 526 uses browse 
history information 532 and session characterization in- 
formation 528 to select an advertisement from charac- 
terized advertisement database 520. Advertisement se- 
lector unit 526 then sends the selected advertisement 
to advertisement Inserter unit 530 so that the selected 
advertisement may be inserted into the datastream and 
addressed to client 502. If advertisement selector unit 
526 decides that categorized advertisements database 
520 does not have a suitable advertisement for a relative 
match with browse history Information 524 and session 
characterization information 528, advertisement selec- 
tor unit 526 may select an advertisement from advertiser 
database 521. 

[0050] In this manner, datastream scanner unit 534, 
advertisement inserter unit 530, and advertisement se- 



lector unit 526 collaborate to monitor the content of the 
datastream directed to the user of browser application 
504. The results of the monitoring of the datastream are 
pushed from the client to the server The server then 
s selects an advertisement that will be specifically target- 
ed to the user of a browser application on the client in a 
manner such that the advertisement appears relevant 
to the content of the rest of the datastream being viewed 
by the user 

10 [0051] In an alternative embodiment, the advertise- 
ment inserter unit 530 may embed additional advertise- 
ments into the datastream other than the advertise- 
ments that are to be immediately shown to the viewer 
at the client In the current Web page. These additional 

75 advertisements may be placed within special markup 
tags that are not displayed by the browser on the client. 
The additional advertisements are cached, however, 
with the other material constituting the Web page. Dur- 
ing periods of inactivity or idle periods during which the 

20 viewer is not actively using the browser, these adver- 
tisements may be shown by the browser on the display 
device of the client. 

[0052] With referencenowtoFigure6, a flowchart de- 
picts a process in which a client receives a datastream 

25 containing adaptively targeted advertisements. Figure 
6 may perform processing on the client as shown pre- 
viously with respect to client 402 in Figure 4. The proc- 
ess begins when a user of a browser application selects 
a hyperlink or Inputs a Uniform Resource Identifier (URI) 

30 Into the client browser (step 602). The client then sends 
HTTP request messages to the server (step 604). At 
some point, the client receives HTTP response messag- 
es from the server In response to the previously trans- 
mitted HTTP request messages, and these HTTP re- 

35 sponse messages contain adaptively targeted adver- 
tisements (step 606). The client browser then displays 
the user requested Information with these advertise- 
ments (step 608). 

[0053] With reference now to Figure 7, a flowchart de- 

40 picts the processing performed by a sen/er that scans 
a datastream in order to insert adaptively targeted ad- 
vertisements addressed to a client. Figure 7 shows 
processing that may be performed on a server similar 
to server 408 In Figure 4. The process begins when the 

45 server receives HTTP request messages from a client 
(step 702). The server routes the HTTP request mes- 
sages onto the Internet if necessary (step 704). Some 
HTTP requests may be directed to URIs within or serv- 
iced by the server, in which case the server may provide 

50 the information directly without routing the HTTP re- 
quest onto the Internet. The server receives HTTP re- 
sponse messages that are to be routed to the client in 
response to the previously routed HTTP request mes- 
sages (step 706). The server then inserts adaptively tar- 

55 geted advertisements into appropriate locations within 
the HTTP response messages that constitute the da- 
tastream addressed to the client (step 708). A datast- 
ream scanner unit within the server scans the datast- 
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ream for keywords in order to generate a user session 
specific list of keywords (step 710). 
[0054] The datastream scanner unit may generate a 
list of keywords in a variety of manners. Although the 
method generally entails scanning the datastream for a 
set of keywords and compiling a count of the individual 
keywords, several different criteria or statistical meth- 
ods may be applied to the occurrence counts to deter- 
mine which keywords should be included as a repre- 
sentative set of keywords for the browse history infor- 
mation. An important point is that many different types 
and sets of rules may be applied to the analysis of the 
datastream in order to determine a general trend of in- 
terest of the user in the material being viewed. Over 
time, this analysis will generate a shifting set of key- 
words to be included as relevant information in the 
browse history information. 

[0055] One method of generating a list of keywords 
encompasses using a moving window of content for 
analysis within the datastream. In other words, a certain 
subset of the datastream is analyzed to generate the 
necessary list of keywords. This moving window of con- 
tent may be determined in several ways. The first meth- 
od of determining the moving window of content is to 
analyze a predetermined number of downloaded Web 
pages as the window of content that should be analyzed 
by the datastream scanner unit. As time passes by, the 
window of content is a predetermined number of the 
most recently downloaded Web pages. A second meth- 
od of determining the size of the moving window of con- 
tent Is the application of a variable number of bytes as 
the measuring standard of the size of the content win- 
dow. This variable size may be set as a function of sev- 
eral parameters, such as the size of the keyword list, the 
amount of data downloaded by the user, etc. 
[0056] An advertisement selector unit in the server 
stores the browse history information received from the 
datastream scanner unit for each client session (step 
712). The browse history information may include other 
information in addition to a generated list of keywords, 
such as URIs that identify the origin of the information 
constituting the datastream. This information is stored 
for later retrieval when the advertisement selector unit 
selects an appropriate advertisement to be sent to the 
client. The server then routes the modified HTTP re- 
sponse messages to the client (step 714). 
[0057] With reference now to Figure 8, a flowchart de- 
scribes a process in which a client scans an incoming 
datastream addressed to the client in order to generate 
browse history information that is transmitted back to a 
server. The process depicted in Figure 8 may be imple- 
mented by a client such as client 502 depicted in Figure 
5. The process begins when the user selects a hyperlink 
or inputs a URI into the client browser (step 802). The 
client then sends HTTP request messages to the sen/er 
along with browsing history information for the current 
session (step 804). The client then receives HTTP re- 
sponse messages from the server containing the previ- 



ously requested information and including adaptively 
targeted advertisements placed into the HTTP response 
messages at appropriate locations in response to the 
browse history information previously sent from the cli- 

s ent to the sen/er (step 806). The client browser then dis- 
plays the user requested information (step 808). 
[0058] The methods by which the datastream scanner 
unit operates in Figures 7 and 8 may vary depending 
on a variety or combination of algorithms used to ana- 

^0 lyze the datastream. A datastream may be comprised 
of various types of information and information objects. 
When a datastream scanner unit scans a datastream, 
each portion of information in the datastream may be 
analyzed separately. For example, in a datastream con- 

^5 sisting primarily of HTML Web pages, hyperlinks and 
markup tags may be analyzed separately from the AS- 
CII text in the body of the pages. 
[0059] As one example of the various processes that 
may be used to analyze a datastream, an adaptive 

20 weighting scheme may be employed that weights par- 
ticular portions of the datastream more heavily than oth- 
er portions, i.e. certain portions are considered to be 
more important than other portions In the determination 
of the gist of interest of the viewer. Hyperlink tags or text 

25 embodied inside HTML tags may be weighted more 
heavily than other types of text. If a viewer of a datast- 
ream were reading about swimsuits, and several of the 
links in the Web page contained URIs pointing to sum- 
mer sports, island vacations, etc., then the scanner unit 

30 may weight the text in these links more heavily than oth- 
er text. When the generated keyword list is used, adver- 
tisements will be chosen that are more directed to the 
content within the hyperlinks than in the body of the text. 
If a person is viewing Web pages by clicking through the 

35 hyperlinks, the advertisements which appear on the 
Web pages with these hyperlinks will possibly closely 
match the content that may appear on the Web pages 
that the person is preparing to view when a hyperlink is 
subsequently selected. 

40 [0060] More particularly, if the scanner has analyzed 
the tags and found a word or set of words within URIs 
in the datastream, advertisers may be requested to pay 
higher advertisement rates to the ISP operating the 
server to insert particular advertisements targeted to 

45 particular URIs. Continuing with the previous example, 
a swimsuit manufacturer may pay a higher fee so that 
an advertisement for a swimsuit appears each time that 
a ,URI appears in the datastream containing the word 
stem "Island," e.g., "http://www.greatvacations.com/is- 

50 lands." More complex formulas may be created for a 
more complex fee schedule. The session characteriza- 
tion information may be combined with the generated 
keyword list to form an effective manner of targeting ad- 
vertisements. Using the previous example, the swimsuit 

55 manufacturer may be more willing to pay higher rates 
for the appearance of a URI with the word stem "island" 
if the session characterization information indicates that 
the person viewing the datastream lives in Hawaii rather 



6/29/2005, EAST Version: 2. 0. 1.-4 



17 



EP 1 061 465 A2 



18 



than Montana. Other information items that may be ob- 
tained in the session characterization information is ex- 
plained in more detail further below. 
[0061] Adaptively targeting advertisements through 
the use of URIs may be especially effective when the s 
scanning Is performed on the outgoing datastream, i.e., 
the datastream flowing from the client to the server and 
routed onto the Internet. The URIs in the HTTP request 
messages are the particular objects that a viewer has 
selected as being of Immediate interest. When the 
words within these URIs are weighted more heavily than 
other words in the datastream, the generated keyword 
list can be more finely focused on the subsequently re- 
trieved datastream, and the advertisements inserted in- 
to the datastream can be effectively targeted to the view- 
er's interest. 

[0062] In addition to URIs. another portion of a datast- 
ream that may be weighted heavily is metadata or 
metainformation. Metadata is data that describes anoth- 
er, associated data item, i.e. information that provides 
characteristics about other information. Web pages con- 
structed with HTML frequently contain META tags that 
contain metadata about the Web page in which the ME- 
TA tags are embedded. Since the content within META 
tags is not presented to the viewer of the Web page, 
these tags can contain any information that the author 
desires to store in a hidden manner. Some commercial 
Web search engines use the content within META tags 
for indexing a Web page. 

[0063] In general, META tags contain a list of key- 
words and a short description of the content of the Web 
page, similar to an abstract. These keywords are cho- 
sen by the author or publisher of the Web page, and 
while the keywords may have been chosen arbitrarily, it 
may be assumed that the information within a META tag 
provides an accurate condensation of its associated 
content. Hence, content within a META tag would be an 
effective target for adaptive weighting. Since the datast- 
ream scanner unit attempts to generate a keyword list, 
a process that weights more heavily a keyword list or 
abstract that has already been embedded within a Web 
page than other portions of a Web page would be espe- 
cially effective for targeting an advertisement that close- 
ly matches the context of the datastream. 
[0064] Metadata may be placed in Internet -delivera- 
ble documents in a variety of manners. As another ex- 
ample besides HTML META tags, extensible Markup 
Language (XML) is being standardized to incorporate 
features for providing metadata. Meta Content Frame- 
work (MCF) and XML-Data are two proposals that incor- 
porate the idea of metainformation into XML. Continuing 
with the point addressed above that many different 
types and sets of rules may be applied to the analysis 
of the datastream, a datastream scanner unit may com- 
bine rules for a moving window with rules for adaptive 
weighting. One manner of combining rules would be to 
introduce a temporal aspect to the weighting of key- 
words. Keywords that have been recently generated by 



scanning a moving window of a datastream may be 
more heavily weighted than previously generated key- 
words. As previously generated keywords increase in 
age, their relative weighting diminishes. 
[0065] With reference now to Figure 9, a flowchart de- 
picts a process in which a server receives browse his- 
tory information from a client and inserts adaptively tar- 
geted advertisements into the datastream sent back to 
the client The process depicted in Figure 9 may be im- 
plemented within a server, such as server 508 shown in 
Figure 5. The process begins when the server receives 
HTTP request messages from the client (step 902). The 
server routes the HTTP request messages onto the In- 
ternet if necessary, or finds the requested information 
within the server (step 904). The server also receives 
periodic reports of browse history information from the 
client (step 906). The advertisement selector unit within 
the sen/er stores the browse history information for each 
client session (step 908). At some point, the server re- 
ceives HTTP response messages from the Internet to 
be routed to the client in response to the original HTTP 
request messages (step 910). The server then inserts 
adaptively targeted advertisements into appropriate lo- 
cations within the HTTP response messages (step 91 2). 
The server then routes the HTTP response messages 
to the client that include the adaptively targeted adver- 
tisements (step 914). 

[0066] With reference now to Figure 10, a flowchart 
depicts the process by which a server selects adaptively 
targeted advertisements based on the browsing history 
of a client. The process depicted in Figure 10 may be 
similar to the.process used by components within a serv- 
er, such as server 408 shown in Figure 4. 
[0067] The advertisement selector unit within the 
server receives a request from the advertisement insert- 
er unit for an advertisement to be placed in the datast- 
ream for a particular client session (step 1002). The ad- 
vertisement selector unit retrieves previously stored 
browse history information for the particular client ses- 
sion (step 1004). The advertisement selector unit also 
retrieves session characterization information (step 
1006). The advertisement selector unit then compares 
the list of keywords from the browse history information 
with previously stored keyword lists for categories of ad- 
vertisements stored in the categorized advertisement 
database (step 1008). The advertisement selector unit 
then compares the session characterization information 
against subcategory information for the set of advertise- 
ments in the previously selected category of advertise- 
ments (step 1010). 

[0068] The session characterization information may 
include types of information, such as: the type of com- 
puter platform that is being utilized as the client compu- 
ter; the current time of day for the session between the 
client and the server; the current season of the year in 
which the session is being held; the amount of available 
connection bandwidth for the session; and the geo- 
graphic location of the client computer. This type of in- 
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formation may narrow the choices tor selecting an ad- 
vertisement subcategory. For example, with a determi- 
nation of a computer platform, such as an IBM compat- 
ible PC or an Apple Macintosh computer, more artistic 
advertisements may be directed towards the user of the s 
Macintosh as the Macintosh is more prevalent within the 
desktop publishing industry With respect to the time of 
day, business type advertisements may be directed to 
the user during the day while leisure or entertainment 
type advertisements are directed to a user in the 
evening. With respect to the season in which the session 
occurs, seasonal advertisements may be directed to a 
user, such as advertisements tor skiing during the winter 
or advertisements for boating during the summer With 
respect to the available connection bandwidth, an ad- 
vertisement with simpler graphics may be directed to a 
user using a 56k modem while complex advertisements 
with MPEG movies may be directed to a user employing 
a cable modem. With respect to the location of the client 
computer, appropriate advertisement categories for the 
source of products and services may be determined 
based on whether the .client computer is located in the 
north versus the south, east versus west, or rural versus 
urban locations. 

[0069] The advertisement selector unit then selects 
an advertisement from the categorized advertisements 
database based on the results of these comparisons 
(step 1012). As noted above, advertisements may also 
be selected based on information in an advertiser data- 
base that generally describes the product or service of- 
fered by an advertiser or describes a general type of 
viewer that the advertiser is attempting to reach with tar- 
geted advertisements. The advertisement selector unit 
then sends the selected advertisement to the advertise- 
ment inserter unit (step 1014). The advertisement in- 
serter unit then inserts the selected advertisement as 
an adaptively targeted advertisement into a modified 
HTTP response message to be routed to the client as 
part of the datastream address to the client (step 1016). 
[0070] The process steps shown in Figure 10 may be 
modified so that the advertisement selection mecha- 
nism is essentially performed in the background and not 
in real time during the routing of the HTTP response 
messages constituting the datastream addressed to the 
client. In other words, when the advertisement selector 
unit receives the browse history information for a partic- 
ular client session, in addition to storing the browse his- 
tory information, the advertisement selector unit may 
begin the process of selecting an adaptively targeted 
advertisement for a client session without waiting for a 
request from the advertisement inserter unit. In this 
manner, the advertisement selector unit may have an 
appropriate advertisement or set of advertisements al- 
ready selected for a particular client so that the adver- 
tisement inserter unit is not stalled waiting for a re- 
sponse from the advertisement selector unit. 
[0071] Once the advertisement selector unit has se- 
lected an appropriate advertisement or advertisements 



for a particular client session, it may store identification 
information for the selected advertisement(s) or a point- 
er(s) into the categorized advertisement database along 
with the stored browse history information within the ad- 
vertisement selector unit. In this example, step 1002 in 
Figure 10 would be performed immediately before step 
1 01 4 so that any of the steps in the process for selecting 
an adaptively targeted advertisement have been previ- 
ously performed, and the advertisement inserter unit 
may receive a quick response for identifying and insert- 
ing an advertisement into the datastream to be routed 
to the client. 

[0072] With reference now to Figure 11, a flowchart 
depicts a process by which a server receives electroni- 
cally published advertisements and processes these 
advertisements so that they may be retrieved at some 
later point as adaptively targeted advertisements. Fig- 
ure 11 describes a process that may be performed by 
a data processing unit, such as advertisement process- 
ing unit 418 in Figure 4. 

[0073] The distributed data processing system that in- 
cludes the sen/er establishing the browser session re- 
ceives electronically published advertisements and 
stores them in a database containing unprocessed ad- 
vertisements (step 1102). At some point, the advertise- 
ment processing unit retrieves the electronically pub- 
lished advertisement previously stored in the database 
of unprocessed advertisements (step 1104). The adver- 
tisement processing unit then generates a list of key- 
words for the retrieved advertisement derived from the 
content within the retrieved advertisement (step 1106). 
The advertisement processing unit then categorizes 
and subcategorizes the retrieved advertisement based 
on its generated list of keywords (step 1108). The ad- 
vertisement processing unit then associatively stores 
the retrieved advertisement along with other electroni- 
cally published and processed advertisements in the 
proper category and subcategory (step 1110). The ad- 
vertisement selector unit on the server may later search 
and retrieve the processed advertisements from the cat- 
egorized advertisement database (step 1112). 
[0074] Thus, the present invention provides a method 
and apparatus for adaptively targeting advertisements 
to a particular user based on the user's browsing history. 
As the user browses the Internet, the user's datastream 
is analyzed for a set of keywords that represents a sum- 
mary of the content that appears to be the focus of the 
interest of the user. Advertisements may be selected 
based on the summary of the interest of the user, and 
the advertisement may be placed into the datastream In 
a manner in which the advertisement closely resembles 
its surrounding context. 

[0075] The present invention also provides a method 
for enticing Web viewers to look at adaptively targeted 
advertisements by giving something of monetary value 
to a viewer in exchange for looking at the advertise- 
ments. Once the viewer agrees to receive advertise- 
ments in a downloaded datastream, the online service 
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provider places advertisements into the viewer's datast- 
ream, and tine viewer receives something of value in re- 
turn, such as free online connection time, connection 
time at a reduced cost, or an acquired credit balance 
that may be applied to other purchases online. The on- 
line service provider or ISP receives revenues from ad- 
vertisers for this particular sen/ice, presumably at a 
higher profit than the revenues that could be collected 
in connection fees from viewers. By knowing which 
viewers are looking at which advertisements, an adver- 
tiser may efficiently use advertising expenditures to tar- 
get those viewers that are most likely to purchase the 
advertiser's products and services. 
[0076] The advantages of the present invention 
should be readily apparent in reference to the descrip- 
tion of the invention above. Without demographic infor- 
mation on the Web viewers visiting particular Web sites 
and domains, advertisers have merely relied upon the 
statistics related to the number of page views. An ad- 
vertiser may then pay for the placement of advertise- 
ments based on the perception of the page traffic in 
which their advertisements may be placed. Advertisers 
may have also chosen the placement of advertisements 
based on the overall commercial aspect of a Web site 
or Web portal. With the use of the present invention, ad- 
vertisements may be more intelligently directed towards 
the viewers of the advertisements in a manner in which 
a viewer may perceive an additional benefit by receiving 
advertisements that are related to a focus of interest. 
The present invention may provide the ability to direct 
advertisements more accurately than the method used 
in placing advertisements on broadcast or cable chan- 
nels. In those cases, consumer research must be con- 
ducted after the broadcast to determine demographics 
of viewers that watched a particular broadcast. Through 
the use of the present Invention, advertisements are 
blended into the content being viewed by a user on a 
real-time or near real-time basis. 
[0077] It is important to note that while the present in- 
vention has been described in the context of a fully func- 
tioning data processing system, those of ordinary skill 
In the art will appreciate that the processes of the 
present invention are capable of being distributed in the 
form of a computer readable medium of instructions and 
a variety of forms and that the present invention applies 
equally regardless of the particular type of signal bear- 
ing media actually used to carry out the distribution. Ex- 
amples of computer readable media include recordable- 
type media such a floppy disc, a hard disk drive, a RAM, 
and CD-ROMs and transmission-type media such as 
digital and analog communications links. 
[0078] The description of the present Invention has 
been presented for purposes of illustration and descrip- 
tion, but Is not Intended to be exhaustive or limited to 
the invention in the form disclosed. Many modifications 
and variations will be apparent to those of ordinary skill 
in the art. The embodiment was chosen and described 
in order to best explain the principles of the invention. 



the practical application, and to enable others of ordi- 
nary skill in the art to understand the invention for vari- 
ous embodiments with various modifications as are suit- 
ed to the particular use contemplated. 
5 [0079] The embodiment is now described in clause 
form where each numbered paragraph represents a 
separate clause. 

1 . A method for providing online connection service 
TO from a server to a client in a distributed data 

processing system, the method comprising the 
computer-implemented steps of: 

inserting selected advertisements Into a datast- 
ream for a session connecting the server with 
the client; and 

reducing a cost for the online connection serv- 
ice in response to the insertion of the selected 
advertisements into the datastream. 

2. A method for providing online connection service 
from a server to a client In a distributed data 
processing system, the method comprising the 
computer-implemented steps of: 

scanning, on the server, a datastream for a ses- 
sion connecting the server with the client; 
selecting advertisements based on the results 
of scanning the datastream; 
inserting the selected advertisements into the 
datastream; and 

reducing a cost for the online connection sen/- 
ice in response to the insertion of the selected 
advertisements into the datastream. 

3. The method of clause 2 further comprising: 
computing a cost for the online connection 

service that is inversely proportional to a number of 
selected advertisements that are inserted into the 
datastream. 

4. The method of clause 2 further comprising: 

selecting, on the client, an option for disabling 
the insertion of the selected advertisements in- 
to the datastream; 

sending the selected option to the server; 
ceasing the insertion of selected advertise- 
ments into the datastream; and 
accounting for a cost for subsequent online 
connection service in a normal manner. 

5. A method, for sending advertisements from a 
sen/er to a client in a distributed data processing 
system, the method comprising the computer-Im- 
plemented steps of: 

scanning, on the server, a datastream for a ses- 
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sion connecting the server with the client; 
generating a list of keywords from content with- 
in the datastream; 

selecting advertisements based on the gener- 
ated keyword list; and s 
inserting the selected advertisements into the 
datastream. 



6. The method of clause 5 wherein the step of scan- 
ning the datastream further comprises: 



10 



storing the datastream as a cached datast- 
ream; and 

reading the cached datastream as input for 
scanning the datastream for the session. is 

7. The method of clause 5 wherein the content of 
the datastream comprises Uniform Resource Iden- 
tifiers (URIs) and content within Hypertext Trans- 
port Control Protocol (HTTP) response messages 20 
received in response to HTTP requests directed to 
the URIs. 

8. The method of clause 5 wherein the step of gen- 
erating a list of keywords comprises deriving a list 25 
of keywords from a moving window of content within 

the datastream. 

9. The method of clause 8 wherein the moving win- 
dow of content is a variable number of downloaded 30 
Web pages. 



selecting advertisements further comprises: 

determining information that characterizes the 
session; and 

selecting an advertisement from the set of ad- 
vertisements in a subcategory of the matched 
category based on the session characterization 
information. 

15. The method of clause 14 wherein the session 
characterization Information is selected from one or 
more types of Information in a group comprising: 

a type of computer platform for the client; 
a current time of day for the session; 
a current season of the year for the session; 
an amount of connection bandwidth for the ses- 
sion; and a geographic location for the client. 

16. The method of clause 5 wherein the step of se- 
lecting advertisements further comprises basing 
the selection on session characterization informa- 
tion selected from one or more types of information 
in a group comprising: 

a type of computer platform for the client; 
a current time of day for the session; 
a current season of the year for the session; 
an amount of connection bandwidth for the ses- 
sion; and 

a geographic location for the client. 



10. The method of clause 8 wherein a size of the 
moving window of content is a variable number of 
bytes in the datastream. 3S 

11. The method of clause 8 wherein a size of the 
moving window is a function of a size of the keyword 
list. 

40 

1 2. The method of clause 5 wherein the step of gen- 
erating a list of keywords further comprises compil- 
ing a list of keywords that most frequently occur 
within a variable amount of time within the session. 

45 

1 3. The method of clause 5 wherein the step of se- 
lecting advertisements further comprises: 

comparing the generated keyword list against 
a set of predetermined keyword lists, wherein 50 
each predetermined keyword list represents a 
category for a set of advertisements; 
determining a relative match as a result of the 
comparison; and 

selecting an advertisement from the set of ad- 55 
vertisements in the matched category 

14. The method of clause 13 wherein the step of 



1 7. The method of clause 5 wherein the step of gen- 
erating a list of keywords further comprises: 

analyzing the datastream to identify a set of 
terms; and 

applying an adaptive weighting scheme in 
which a subset of terms in the set of terms are 
given greater emphasis as a keyword in the list 
of keywords. 

18. The method of clause 7 wherein the content of 
the datastream comprises Uniform Resource Iden- 
tifiers (URIs) and terms within the URIs are weight- 
ed with greater value than other terms not within the 
URIs. 

1 9. The method of clause 1 7 wherein the content of 
the datastream comprises metadata, and wherein 
the metadata is weighted with greater value than 
other content in the datastream. 

20. The method of clause 17 wherein the adaptive 
weighting of terms is a function of a time at which 
the terms appear in the datastream such that more 
recent terms are weighted more heavily that more 
distant terms. 
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21. A method for receiving advertisements at a cli- 
ent from a server in a distributed data processing 
system, the method comprising the computer-im- 
plemented steps of: 

5 

scanning, on the client, a datastream for a ses- 
sion connecting the client with the server; 
generating a list of keywords from content with- 
in the datastream; 

sending the generated list of keywords to the to 
server; and 

receivirig advertisements within the datast- 
ream, wherein the advertisements comprise 
content semantically related to the generated 
list of keywords. 

22. The method of clause 21 further comprising: 

caching advertisements placed as hidden ad- 
vertisement content within markup language tags in 
the datastream, wherein the hidden advertisement 20 
content is not displayed by the browser when a 
markup language document containing the hidden 
advertisement content is displayed. 

23. The method of clause 22 further comprising: 25 

retrieving the hidden advertisement content 
during periods of user inactivity; and 
displaying advertisements derived from the hid- 
den advertisement content on a display device 50 
of the client. 

24. A distributed data processing system for provid- 
ing online connection service from a server to a cli- 
ent, the distributed data processing system com- 55 
prising: 

inserting means for inserting selected adver- 
tisements into a datastream for a session con- 
• necting the server with the client; and 40 
reducing means for reducing a cost for the on- 
line connection service in response to the in- 
sertion of the selected advertisements into the ' 
datastream. 

45 

25. A distributed data processing system for provid- 
ing online connection service from a server to a cli- 
ent, the distributed data processing system com- 
prising: 

50 

scanning means tor scanning, on the server, a 
datastream for a session connecting the server 
with the client; 

selecting means for selecting advertisements 
based on the results of scanning the datast- 55 
ream; 

inserting means tor inserting the selected ad- 
vertisements into the datastream; and 



reducing means for reducing a cost tor the on- 
line connection service in response to the in- 
sertion of the selected advertisements into the 
datastream. 

26. The data processing system of clause 25 further 
comprising: 

computing means for computing a cost tor the 
online connection service that is inversely propor- 
tional to a number of selected advertisements that 
are inserted into the datastream. 

27. The data processing system of clause 25 further 
comprising: 

selecting means for selecting, on the client, an 
option for disabling the insertion of the selected 
advertisements into the datastream; 
sending means for sending the selected option 
to the server; 

ceasing means for ceasing the insertion of se- 
lected advertisements into the datastream; and 
accounting means for accounting for a cost for 
subsequent online connection service in a nor- 
mal manner. 

28. A distributed data processing system for send- 
ing advertisements from a server to a client, the dis- 
tributed data processing system comprising: 

scanning means for scanning, on the server, a 
datastream tor a session connecting the server 
with the client; 

generating means for generating a list of key- 
words from content within the datastream; 
selecting means for selecting advertisements 
based on the generated keyword list; and 
inserting means for inserting the selected ad- 
vertisements Into the datastream. 

29. The data processing system of clause 28 
wherein the scanning means for scanning the da- 
tastream further comprises: 

storing means for storing the datastream as a 
cached datastream; and 
reading means for reading the cached datast- 
ream as input for scanning the datastream for 
the session. 

30. The data processing system of clause 28 
wherein the content of the datastream comprises 
Uniform Resource Identifiers (URls) and content 
within Hypertext Transport Control Protocol (HTTP) 
response messages received in response to HTTP 
requests directed to the URls. 

31. The data processing system of clause 28 
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wherein the generating means for generating a list 
of keywords comprises deriving means for deriving 
a list of keywords from a moving window of content 
within the datastream. 

5 

32. The data processing system of clause 31 
wherein the moving window of content is a variable 
number of downloaded Web pages. 

33. The data processing system of clause 31 io 
wherein a size of the moving window of content is 

a variable number of bytes in the datastream. 

34. The data processing system of clause 31 
wherein a size of the moving window is a function 
of a size of the keyword list. 

35. The data processing system of clause 28 
wherein the generating means for generating a list 

of keywords further comprises compiling means for 20 
compiling a list of keywords that most frequently oc- 
cur within a variable amount of time within the ses- 
sion. 

36. The data processing system of clause 28 2S 
wherein the selecting means for selecting adver- 
tisements further comprises: 

comparing means for comparing the generated 
keyword list against a set of predetermined key- 30 
word lists, wherein each predetermined key- 
word list represents a category for a set of ad- 
vertisements; 

determining means for determining a relative 
match as a result of the comparison; and 
selecting means for selecting an advertisement 
from the set of advertisements in the matched 
category. 

37. The data processing system of clause 36 
wherein the selecting means for selecting adver- 
tisements further comprises: 

determining means for determining means for 
determining information that characterizes the 45 
session; and 

selecting means for selecting an advertisement 
from the set of advertisements in a subcategory 
of the matched category based on the session 
characterization information. so 

38. The data processing system of clause 37 
wherein the session characterization information is 
selected from one or more types of information in a 
group comprising: ss 

a type of computer platform for the client; 
a current time of day for the session; 



a current season of the year for the session; 
an amount of connection bandwidth for the ses- 
sion; and 

a geographic location for the client. 

39. The data processing system of clause 28 
wherein the selecting means for selecting adver- 
tisements further comprises selecting means for 
basing the selection on session characterization in- 
formation selected from one or more types of infor- 
mation in a group comprising: 

a type of computer platform for the client; 
a current time of day for the session; - 
a current season of the year for the session; 
an amount of connection bandwidth for the ses- 
sion; and 

a geographic location for the client. 

40. The data processing system of clause 28 
wherein the generating means for generating a list 
of keywords further comprises: 

analyzing means for analyzing the datastream 
to identify a set of terms; and 
applying means for applying an adaptive 
weighting scheme in which a subset of terms in 
the set of terms are given greater emphasis as 
a keyword in the list of keywords. 

41. The data processing system for clause 40 
wherein the content of the datastream comprises 
Uniform Resource Identifiers (URIs) and terms 
within the URIs are weighted with greater value than 
other terms not within the URIs. 

42. The method of clause 40 wherein the content of 
the datastream comprises metadata, and wherein 
the metadata is weighted with greater value than 
other content in the datastream, 

43. The method of clause 40 wherein the adaptive 
weighting of terms is a function of a time at which 
the terms appear in the datastream such that more 
recent terms are weighted more heavily than more 
distant terms. 

44. A distributed data processing system for receiv- 
ing advertisements at a client from a server, the dis- 
tributed data processing system comprising: 

scanning means for scanning, on the client, a 
datastream for a session connecting the client 
with the server; 

generating means for generating a list of key- 
words from content within the datastream; 
sending means for sending the generated list 
of keywords to the server; and 
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receiving means for receiving advertisements 
within the datastream, wherein the advertise- 
ments comprise content semantically related to 
the generated list of keywords. 

5 

45. The data processing system of clause 44 further 
comprising; 

caching means for caching advertisements 
placed as hidden advertisement content within 
markup language tags in the datastream. wherein 
the hidden advertisement content is not displayed 
by the browser when a markup language document 
containing the hidden advertisement content is dis- 
played. 

75 

46. The data processing system of clause 45 further 
comprising: 

retrieving means for retrieving the hidden ad- 
vertisement content during periods of user in- 20 
activity; and 

displaying means for displaying advertise- 
ments derived from the hidden advertisement 
content on a display device of the client. 

25 

47. A computer program product In a computer 
readable medium for use in a distributed data 
processing system for providing online connection 
service from a server to a client, the computer pro- 
gram product comprising: 50 

first instructions for inserting selected adver- 
tisements into a datastream for a session con- 
necting the server with the client; and 
second instructions for reducing a cost for the 3S 
online connection service in response to the in- 
sertion of the selected advertisements Into the 
datastream. 

48. A computer program product in a computer 
readable medium for use in a distributed data 
processing system for providing online connection 
service from a server to a client, the computer pro- 
gram product comprising: 

45 

first instructions for scanning, on the server, a 
datastream for a session connecting the server 
with the client; 

second instructions for selecting advertise- 
ments based on the results of scanning the da- 50 
tastream; 

third instructions for Inserting the selected ad- 
vertisements Into the datastream; and 
fourth instructions for reducing a cost for the on- 
line connection service In response to the In- 5S 
sertion of the selected advertisements Into the 
datastream. 



49. The computer program product of clause 48 fur- 
ther comprising: 

instructions for computing a cost for the online 
connection service that Is Inversely proportional to 
a number of selected advertisements that are in- 
serted into the datastream. 

50. The computer program product of clause 48 fur- 
ther comprising: 

instructions for selecting, on the client, an op- 
tion for disabling the Insertion of the selected 
advertisements into the datastream; 
Instructions for sending the selected option to 
the sen/er; 

instructions for ceasing the Insertion of select- 
ed advertisements into the datastream; and 
Instructions for accounting for a cost for subse- 
quent online connection service in a normal 
manner. 

51. A computer program product In a computer- 
readable medium for use in a distributed data 
processing system for sending advertisements from 
a server to a client, the computer program product 
comprising: 

first Instructions for scanning, on the server, a 
datastream for a session connecting the server 
with the client; 

second Instructions for generating a list of key- 
words from content within the datastream; 
third instructions for selecting advertisements 
based on the generated keyword list; and 
fourth instructions for insertirig the selected ad- 
vertisements into the datastream. 

52. The computer program product of clause 51 
wherein the first instructions for scanning the da- 
tastream further comprises: 

Instructions for storing the datastream as a 

cached datastream; and 

instructions for reading the cached datastream 

as input for scanning the datastream for the 

session, 

53. The computer program product of clause 51 
wherein the content of the datastream comprises 
Uniform Resource Identifiers (URIs) and content 
within Hypertext Transport Control Protocol (HTTP) 
response messages received in response to HTTP 
requests directed to the URIs. 

54. The computer program product of clause 51 
wherein the second instructions for generating a list 
of keywords comprises Instructions for deriving a 
list of keywords from a moving window of content 
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within the datastream. 

55. The computer program product of clause 54 
wherein the moving window of content is a variable 
number of downloaded Web pages. s 

56. The computer program product of clause 54 
wherein a size of the moving window of content is 
a variable number of bytes in the datastream. 

10 

57. The computer program product of clause 54 
wherein a size of the moving window is a function 
of a size of the keyword list. 

58. The computer program product of clause 51 '5 
wherein the second instructions for generating a list 

of keywords further comprises instructions for com- 
piling a list of keywords that most frequently occur 
within a variable amount of time within the session. 

20 

59. The computer program product of clause 51 
wherein the third instructions for selecting adver- 
tisements further comprises: 

instructions for comparing the generated key- 2S 
word list against a set of predetermined key- 
- word lists, wherein each predetermined key- 
word list represents a category for a set of ad- 
vertisements; 

instructions for determining a relative match as oo 
a result of the comparison; and 
instructions for selecting an advertisement from 
the set of advertisements in the matched cate- 
gory. 

35 

60. The computer program product of clause 59 
wherein the instructions for selecting an advertise- 
ment from the set of advertisements in the matched 
category further comprises: 

40 

instructions for determining information that 
characterizes the session; and 
instructions for selecting an advertisement from 
the set of advertisements in a subcategory of 
the matched category based on the session ^5 
characterization information. 

61. The computer program product of clause 60 
wherein the session characterization information is 
selected from one or more types of information in a so 
group comprising: 

a type of computer platform for the client; 
a current time of day for the session; 
a current season of the year for the session; 55 
an amount of connection bandwidth for the ses- 
sion; and 

a geographic location for the client. 



62. The computer program product of clause 51 
wherein the third Instructions for selecting adver- 
tisements further comprises instructions for basing 
the selection on session characterization informa- 
tion selected from one or more types of Information 
In a group comprising: 

a type of computer platform for the client; 
a current time of day for the session; 
a current season of the year for the session; 
an amount of connection bandwidth for the ses- 
sion; and 

a geographic location for the client. 

63. The computer program product of clause 51 
wherein the second instructions for generating a list 
of keywords further comprises: 

instructions for analyzing the datastream to 
Identify a set of terms; and 
Instructions for applying an adaptive weighting 
scheme in which a subset of terms in the set of 
terms are given greater emphasis as a keyword 
in the list of keywords. 

64. The computer program product of clause 63 
wherein the content of the datastream comprises 
Uniform Resource Identifiers (URIs) and terms 
within the URIs are weighted with greater value than 
other terms not within the URIs. 

65. The computer program product of clause 63 
wherein the content of the datastream comprises 
metadata, and wherein the metadata is weighted 
with greater value than other content In the datast- 
ream. 

66. The computer program product of clause 63 
wherein the adaptive weighting of terms Is a func- 
tion of a time at which the terms appear in the da- 
tastream such that more recent terms are weighted 
more heavily than more distant terms, 

67. A. computer program product for receiving ad- 
vertisements at a client from a server in a distributed 
data processing system, the computer program 
product comprising: 

first instructions for scanning, on the client, a 
datastream for a session connecting the client 
with the sen/er; 

second instructions for generating a list of key- 
words from content within the datastream; 
third instructions for sending the generated list 
of keywords to the server; and 
fourth Instructions for receiving advertisements 
within the datastream, wherein the advertise- 
ments comprise content semantically related to 
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the generated list of keywords. 

68. The computer program product of clause 67 fur- 
ther comprising: 

instructions for caching advertisements 
placed as hidden advertisement content within 
markup language tags in the datastream, wherein 
the hidden advertisement content is not displayed 
by the browser when a markup language document 
containing the hidden advertisement content is dis- 
played. 

89. The computer program product of clause 68 fur- 
ther comprising: 

instructions for retrieving the hidden advertise- 
ment content during periods of user inactivity; 
and 

instructions for displaying advertisements de- 
rived from the hidden advertisement content on 
a display device of the client. 

[0080] In summary a method and apparatus for adap- 
ttvely targeting advertisements to a specific client com- 
puter from a server within a distributed data processing 
system is provided. As a user of the client browses the 
World Wide Web, the material that is downloaded to the 
client constitutes a datastream. At some location during 
the routing of the datastream, either on the server or at 
the client, the datastream is scanned to generate a list 
of keywords that are present within the datastream. The 
datastream may be analyzed in real-time or cached and 
analyzed on a delayed basis. The generated list of key- 
words represents a summary of the content that ap- 
pears to be the focus of interest of the user The key- 
words are compared against a database of advertise- 
ments, and the server selects an advertisement that 
matches the user's area of interest In comparison to the 
analysis of the user's browsing history. The selected ad- 
vertisement is then inserted into the datastream to be 
routed to the client. In consideration for viewing targeted 
advertisements and to entice a Web viewer to allow the 
monitoring of a datastream so that targeted advertise- 
ments may be placed into the datastream, a Web viewer 
may receive online connection service for free, for a re- 
duced cost, at a premium level of service, or for other 
some other value, such as frequent viewer credits that 
may be exchanged for goods and services. 



Claims 

1. A method for providing online connection service 
from a server to a client in a distributed data 
processing system, the method comprising the 
computer-implemented steps of: 

inserting selected advertisements into a datast- 



ream for a session connecting the server with 
the client; and 

reducing a cost for the online connection serv- 
5 ice in response to the insertion of the selected 

advertisements into the datastream. 

2. A method of claim 1 further comprising the steps of: 

10 scanning, on the server, the datastream for a 

session connecting the server with the client; 
and 

selecting advertisements based on the results 
15 of scanning the datastream. 

3. The method of claim 2 further comprising: 

computing a cost for the online connection 
service that Is inversely proportional to a number of 
20 selected advertisements that are inserted into the 
datastream. 

4. The method of claim 2 further comprising: 

26 selecting, on the client, an option for disabling 

the insertion of the selected advertisements in- 
to the datastream; 

sending the selected option to the server; 

30 

ceasing the insertion of selected advertise- 
ments into the datastream; and 

accounting for a cost for subsequent online 
35 connection service in a normal manner. 

5. A method for sending advertisements from a server 
to a client in a distributed data processing system, 
the method comprising the computer-implemented 

40 steps of: 

scanning, on the server, a datastream for a ses- 
sion connecting the server with the client; 

45 generating a list of keywords from content with- 

in the datastream; 

selecting advertisements based on the gener- 
ated keyword list; and 

so 

inserting the selected advertisements into the 
datastream. 

6. The method of claim 5 wherein the step of scanning 
55 the datastream further comprises: 

storing the datastream as a cached datast- 
ream; and 
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reading the cached datastream as input for 
scanning the datastreann for the session. 

7. The method 6t clainn 5 wherein the content of the 
datastreann comprises Uniform Resource Identifi- 
ers (URIs) and content within Hypertext Transport 
Control Protocol (HTTP) response messages re- 
ceived in response to HTTP requests directed to the 
URIs. 

8. The method of claim 5 wherein the step of generat- 
ing a list of keywords comprises deriving a list of 
keywords from a moving window of content within 
the datastream. 

9. The method of claim 8 wherein the moving window 
of content is a variable number of downloaded Web 
pages. 

1 0. The method of claim 8 wherein a size of the moving 
window of content is one or more of: 

(i) a variable number of bytes in the datastream; 

(ii) a function of a size of the keyword list. 

1 1 . The method of claim 5 wherein the step of selecting 
advertisements further comprises: 

comparing the generated keyword list against 
a set of predetermined keyword lists, wherein 
each predetermined keyword list represents a 
category for a set of advertisements; 

determining a relative match as a result of the 
comparison; and 

selecting an advertisement from the set of ad- 
vertisements in the matched category. 

12. A distributed data processing system for providing 
online connection service from a server to a client, 
the distributed data processing system comprising: 

inserting means for inserting selected adver- 
tisements into a datastream for a session con- 
necting the server with the client; and 



datastream for a session connecting the server 
with the client; 

generating means for generating a list of key- 
5 words from content within the datastream; 

selecting means for selecting advertisements 
based on the generated keyword list; and 

TO inserting means for inserting the selected ad- 

vertisements into the datastream. 

14, A distributed data processing system for receiving 
advertisements at a client from a server, the dlstrib- 
uted data processing system comprising: 

scanning means for scanning, on the client, a 
datastream for a session connecting the client 
with the server; 

20 

generating means for generating a list of key- 
words from content within the datastream; 

sending means for sending the generated list 
25 of keywords to the server; and 

receiving means for receiving advertisements 
within the datastreann, wherein the advertise- 
ments comprise content semantically related to 
30 the generated list of keywords. 

15. A computer program product in a computer reada- 
ble medium for use in a distributed data processing 
system for providing online connection service from 

35 a server to a client, the computer program product 
comprising: 

first instructions for inserting selected adver- 
tisements into a datastream for a session con- 
40 necting the server with the client; and 

second instructions for reducing a cost for the 
online connection service in response to the in- 
sertion of the selected advertisements into the 
45 datastream. 



reducing means for reducing a cost for the on- 
line connection service in response to the in- so 
sertion of the selected advertisements into the 
datastream. 



13. A distributed data processing system for sending 
advertisements from a server to a client, the distrib- 55 
uted data processing system comprising: 

scanning means tor scanning, on the server, a 
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